# Video Temporal Analysis
Qwen2.5 VL 3B Instruct GGUF
Qwen2.5-VL is the latest vision-language model in the Qwen family, featuring powerful visual understanding and multimodal processing capabilities.
Image-to-Text English
Q
unsloth
4,645
4
Qwen2.5 VL 7B Instruct AWQ
Apache-2.0
Qwen2.5-VL is a multimodal vision-language model launched by Tongyi Qianwen, featuring powerful image understanding and text generation capabilities.
Image-to-Text
Transformers English

Q
Benasd
226
7
Featured Recommended AI Models